A new bound for the smallest x with 
tt(x) > li(x 



Kuok Fai Chao and Roger Plymen 



School of Mathematics, Alan Turing building, Manchester University 
Manchester M13 9PL, England 

kchao@maths.manchester.ac.uk, plymen@manchester.ac.uk 



Abstract 

We reduce the leading term in Lehman's theorem. This improved 
estimate allows us to refine the main theorem of Bays & Hudson [2]. 
Entering 2, 000, 000 Riemann zeros, we prove that there exists x in the 
interval [exp(727.951858) , exp(727.952178)] for which ir(x) - li(x) > 
3.2 x 10 . There are at least 10 154 successive integers x in this 
interval for which ir(x) > This interval is strictly a sub-interval 

of the interval in Bays & Hudson, and is narrower by a factor of about 
12. 

Keywords: Riemann zeros, Skewes number, primes, logarithmic integral, 
first crossover. 
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1 Introduction 

Let ir(x) denote the number of primes less than or equal to x, and let li(a;) 
denote the logarithmic integral. The notation f(x) = Q±g(x) means that 

lim sup f(x)/g(x) > 0, lim inf f(x)/g(x) < 

There was, in 1914, overwhelming numerical evidence that ir(x) < li(x) for 
all x. In spite of this, Littlewood [9J announced that 

7r(x) — li(x) = fi±(x 1/ ' 2 (logx)~ 1 log log log a;) 
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This implies that tt(x) — li(x) changes sign infinitely often. Littlewood's 
method provided, even in principle, no definite number X before which 
7r(x) — li(x) changes sign. For a recent proof of Littlewood's theorem, see 
[TT1 Theorem 15.11]. 

In the course of the 20th century, successive numerical upper bounds were 
found by Skewes [UJ , Skewes [IE] , Lehman [H] , te Riele P2] . For Littlewood's 
own account of the discovery of the Skewes numbers, see [10, p. 110-112]. 

The smallest value of x with ir(x) > li(x) will be denoted H, as in the 
recent paper by Kotnik [5]. In the course of a systematic computational 
study, Kotnik proves that 

10 14 < S. 

We now explain the main idea in [5]. Lehman's theorem is an integrated 
version of the Riemann explicit formula. His method was to integrate the 
function u \— > Tr(e u ) — li(e n ) against a Gaussian kernel over a carefully chosen 
interval [u — rj, u + rj\. The definite integral so obtained is denoted I{uj,rj). 
Let p = 1/2 + ij denote a Riemann zero with 7 > and let 

#(!»:= -23? —e^ 2 / 20 . 

0<7<T P 

The a in this formula is related to the kernel chosen. Lehman proved the 
following equality 

I{w,rj) = -1+H{T,uj) + R 
together with an explicit estimate \R\ < e. This creates the inequality 

I(u,r})>H(T,u)-(l + e). 

The problem now is to prove that 

H(T,u)>l + e. (1) 

If ([T]) holds, then I(u,T]) > and so there exists x e [e^ - ^, e w+r? ] for which 
7r(x) > li(x). In order to establish (TjQ), numerical values of the Riemann 
zeros with I7I < T are required. Each term in H(T,u) is a complex num- 
ber determined by a Riemann zero. It is necessary that the real parts of 
these complex numbers, which are spiralling towards 0, reinforce each other 
sufficiently for (TjQ) to hold. The only known way of establishing this is by 
numerical computation. When T is large, this requires a computer. 
In 2000, Bays & Hudson [2] made the following selection: 

uj = 727.95209, 77 = 0.002. 
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The interval itself is [exp(727.95009),exp(727.95409)]. We note that the nu- 
merical value of exp(727.95409) is incorrectly stated in [21 Theorem 2]. 

In this article, we reduce the leading term in Lehman's theorem. This 
enables us to select the following parameters: 

uj = 727.952018, r) = 0.00016. 

The interval is [exp(727.951858), exp(727. 952178)] and so an upper bound 
for the first crossover 3 is 

3 < exp(727.952178) < 1.398344 x 10 316 

Our interval is strictly a sub-interval of the Bays-Hudson interval. It 
is narrower by a factor of about 12, and creates the smallest known upper 
bound. 

The function H(T,u) is an initial part of the series 

H{uo) := -23? V — e -r72a 
p 

0<7 F 

As Rademacher observed in 1956 [H], the Riemann Hypothesis plus 
Weyl's criterion imply that, for each to > 0, the sequence 

{exp(z 7 cj) : C(l/2 + ij) = 0, 7 > 0} 

is equidistributed in the unit circle. So we may expect a fair amount of 
cancellation to take place in the series H(u). This may help to explain why 
it is so difficult to find a number u for which H(T,u) exceeds 1. 

We reflect, for a moment, on the Weil explicit formula. This is an iden- 
tity between two distributions [TBI p. 39]. It is well established that certain 
classical explicit formulas follow from the Weil explicit formula, by picking 
suitable test-functions. For example, classical formulas for Dirichlet L-series 
may be derived in this way, see P, Theorem 3.2, p. 340]. We are led to ask 
whether the Lehman formula can be obtained from the Weil explicit formula 
by picking a suitable test function. We hope to pursue this idea elsewhere. 

We would like to thank Andrew Odlyzko for supplying us with the first 
2,000,000 Riemann zeros, Jon Keating for drawing our attention to [14], 
Christine Lee for improvements in the exposition, Aleksandar Ivic for drawing 
our attention to the inequalities of Panaitopol [12], and Nick Gresham for 
assistance with numerical calculations. Finally, we thank the referee for his 
many detailed and constructive comments. 
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2 The Leading Term 

We begin this section with Lehman's theorem. 

Theorem 2.1. (Lehman fBf) Let A be a positive number such that = \ for 
all zeros p = [3 + ry of ((s) for which < 7 < A. Let a, r\ and to be positive 
numbers such that u — rj > 1 and 

2/A < 2A/a < 77 < lu/2. (2) 

Let 

1(00,7]) := / +V K(u - uj)ue- u/2 {Tt(e u ) - li(e u )}du 

J UJ — T) 

Then for 2ire < T < A 

I(oj, V ) = -1- J2 —e-~< 2/2a + R 

0<|7l<r ^ 



where 
with 



\R\ < s 1 + s 2 + s 3 + s 4 + s 5 + s 6 



3.05 



si 
s 3 



UJ — 7] 

4(cu + r/)e- ( "-" )/6 

2e -ar, 2 /2 



s 4 = 0.08^e- ar > 2/2 

8* = e - T2 / 2 n^io S - + ^^ + -i 

5 VT 2 g 2vr + T + T3 j 

s 6 = A\ogAe- A2/2a+ ^ )/2 {Aa- l/2 + 15^} 

// £/ie Riemann Hypothesis holds, then conditions (dp and i/ie term S6 m t/ie 
estimate for R may be omitted. 

For the rest of the paper p = (3 + 27 will denote a zero of the Riemann 
zeta function ((s) for which < /3 < 1. We will refine a part of Lehman's 
proof. This allows us to reduce the term s\ in Lehman's theorem. 
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The logarithmic integral is defined as follows [4, p. 82]: 

/x+iy p t 
-dt 
-oo+iy " 

where z = x + iy, y ^ 0. For x > 1, li(x) is then defined as follows: 

li(x) := ^[li(x + iO) + li(x - i0)] 

In this way, we recover the classical definition of li(x) as an integral principal 
value [U p.82]: 

\i(x) = lim ( / — dt + / — dt 

1 ' ~o Wo t J 1+e t 

For a detailed account of the logarithmic integral, see [TJ p. 38-41]. 
We define 

J{x) := n{x) + ^vr(x 1/2 ) + ^vr(x 1/3 ) . . . 
and recall the Riemann-von Mangoldt explicit formula: 

J(x) = \i(x) - V li(z>) + . 2 f " : log 2 

w w £^ \ / (u 2 -l)u\ogu 



(3) 



valid for x > 1. According to P2J (3.2) and (3.6), p. 69] we have, for all x > 1, 

. , x 3i?i(x)x/2 

^ = i + 7T ^ 

logx (logx)^ 

, . §2{x)2x fi . 

<x) = r 4 

logx 

with |i?i(x)| < 1, $2(2^) < 0.62753. There are at most [(log x) / log 2] terms 
in J(x). This allows us implicitly to define $3 (2) by the following equation: 

J(x) = vr(x) + \{x^) + hhlr)*&l*) (j^j (5) 

Then we have ^(x) < 1. Combining (4) and (6), we have 

tt(x) - li(x) = - V li(x') + /°° . - ^ log 2 

-V /2 )--^(xWi i/3 ) f!^ 

2 v ; 3 V ; V ; V!og 2 
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Substituting the first expression in (j3J) for ^(x 1 / 2 ) and the second expression 
for 7r(x 1//3 ), we get 

vr(x) - li(x) = - J2 + r (2 TV 1 a - bg 2 

p 7x (V-lJwlogW 

f x 1 ' 2 + g ^ 1 (x 1 / 2 )x 1 / 2 \ _ ^ / ^ 2 (x 1 / 3 )2x 1 / 3 \ 
\logx (logx) 2 / \ log2 J 

Let 

^(x) := f°° 7 5 , lo S 2 



For x > 2 we have the following bounds: 

-log 2 < 4 (ac) < 1/2 - log 2. 

We now have 



tt(x) - li(x) = - J>( 



x p ) 



logx 



+ 3 7j ^ V 1/4 (x) 

(logx) 2 log 2 

Now define $(x) as follows: 

4tf(x)x 1/3 := 4 (z) - JV ; f i 

w w log2 

Then we have 

|tf(x)| < 4- x x- 1/3 (l/2 - log 2) + 1/2 < 1 - log 2 < 1 

for all x > 2. Here is where our method differs from Lehman's approach: we 
keep the estimate d\{x) separate from d(x). We have 

tt(x) - li(x) = Y li(x") - + 3 ^( xl/2 \f 2 + #( X )4 X W ( 6 ) 

logx (logx) 2 

Now we improve the bound for i?i(x). We quote a result of Panaitopol [T2"l 
Theorem 1]: 

x 

7r(x) < ' r— TT for all x > 6 (7) 

v 7 logx - 1 - (logx)" - 5 v ' 
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From (j3J) and ((7|), we get 



+ 2 . o < 



.r 



logx log 2 x " log x — 1 — (logx) 5 

Denote y = := (log x) 2. The inequality (jHJ) will lead to an upper bound 
for i?i(a;): 



We define 



< i9i(x) < - ■ I +V for all x > 6. 
3 r - y - 1 

y 3 + ?/ 2 



(9) 



F(y) :-- 



y 3 - y - 1 



We have F(y) > 1, -F(y) — ► 1 as y — > 00, and 

F'(y) = (-V - 3y 2 - l)/(i/ 3 - y - l) 2 < 
so that F is a monotone decreasing function. By ([91) we have 



and so 



^(e^)<--F(v^) 



3#i(e u/2 ) < 2F{\l 7 -f) < 2.1 1 11 



if u > 727. 

From ([6]) we have immediately: 



ue 



- u/2 {n(e u ) - li(e M )} = -1 - V M e~ M/2 li(e pu ) + ^(e u/2 )- + 4tf(e u 

/ — ' u 



ue 



-u/6 



Now is a standard Gaussian distribution, and so 



K{u — uj)du 



LO — T] 



/V 
K(v)dv < 1. 
-n 



If cl; — 77 > 727 then we have the estimate 

K(u - uJ)(^ l (e u,2 )- + Mue~ u/6 )du 

U—T) 

We have replaced the term s\ by s[: 



< ^1111 + 4(w + 77 ) e -(-^)/6 

UJ — Tj 



2.1111 

s i = 

— 77 

Following the steps in Lehman's proof [8j, we are led to a new estimate \R'\: 

\R'\ < s[ + s 2 + s 3 + s 4 + s 5 + s 6 (10) 
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Theorem 2.2. Let A be a positive number such that (3 — | for all zeros 
p = (3 + ry of ((s) for which < 7 < A. Let a, r\ and u be positive numbers 
such that uo — 7] > 727 and 

2/A < 2A/a < 77 < u/2. (11) 

Let K(y) and I(u, rj) be defined as in Theorem 2.1. Then for 27ce < T < A 
we have 

1(10,7]) = -1- —e-^' 2a + B! 

0<|7|<T ^ 

where an upper bound for R' is given by (TU) . 

If the Riemann Hypothesis holds, then conditions (Tl\) and the term s 6 in 
the estimate for R! may be omitted. 



Note. We are setting out to narrow the interval [u — rj, u + rj] . However, 
the inequalities (TTTj) include a lower bound on r]. We explain briefly how this 
lower bound arises. Without the Riemann Hypothesis, Lehman proves, by 
means of several intricate estimates [SI p. 404-406], that the inequalities (TTTT) 
are a sufficient condition for the following crucial estimate: 



E 

h\>A 



K(u 



to )ue 



-u/2 



li (e pu )du 



uj—r) 



< S 6 . 



3 Numerical Results 

In this section we exploit the reduced term s[ to obtain improved numer- 
ical results. We commence with some remarks concerning the accuracy of 
computing by machine. Adopting notation similar to te Riele [IS] we set 

H{T,a,u,) = - e~^ l2a - — = - £ t( 7} a,u) 

0<|7|<T ^ 0<7<T 

where 

t( 7 ,a,c,) = e-^- COS( "V 27 ? Sin( " 7) 

I + 7 2 

Fixing N = 2, 000, 000, we denote by 7* the approximations to the true values 
of 7i, correct to nine decimal places, computed by Odlyzko for 1 < i < N, 
and set T = 1131944.47182487 > 77V. Let H*(T,a,u) be the value obtained 
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by taking the sum up to T using £(7*, a, uS) and let H* M (T,a,u) be the 
result of computing the same sum by machine. The function H* M therefore 
depends on many machine-specific details, including the implementation of 
standard library functions sin, cos and exp; similar considerations apply to 
the evaluation of s[, . . . , sq. In the case of H the quantity we wish to control 
is 

\H - H* M \ <\H - H*\ + \H* - H* M \ 

The terms on the right represent errors arising from the conditioning of H 
upon the 7$ and numerical instability in computing by machine respectively. 
Following [15] we have for 7 < a 

\H(T, a, u) — H*(T, a,u) \ < ^ | 7 - 7*1 • M(j, a, u) 

0<7<T 

with 

_^ /2a f2co to 2 2 4\ 3cj + 8 

M(>y,a,u) = e t / 2a — + +-+ + < 

y 7 7 Z a 7 d 7 z y 7 

Now 14 < 7 < 1131945 < a, and in the region of interest, we have 
uj < 728, so we may use the estimate 

- 1 

\H(T,a,u) -H*(T,a,u)\ < 10" 9 ■ 2192 ■ > - 

and we find numerically that ^2i=x ~ < 12, so that \H — H*\ is bounded 
above by 3 x 10" 5 . 

It remains to address the extent of the additional error arising from ma- 
chine computation of H* and the quantities s[,...,Sq. Let us denote by 
R' M the machine-evaluated sum s'± + • — h Sq. Our initial experiments were 
conducted using Matlab, but to speed sampling of the space of parameters 
(id, a, T], A) we re-implemented matters in C on an x86_64 GNU/Linux system 
using native double precision and routines from the GNU standard mathemat- 
ics library libm. The results for specific valid choices of (u,a,rj,A) with 
H* M — (1 + R' M ) > 1 x 10~ 4 were then re-computed using the arbitrary- 
precision libraries ARPRECpQ and mpfr[3], running at up to 100 digits and 
1024 bits of precision respectively. Upon rounding to 7 decimal places all 
values obtained were in agreement; we are therefore confident that the cu- 
mulative effects of adverse numerical phenomena lie well below the threshold 
of 1 x 10" 6 . 

A simple strategy for selecting suitable (u, a, rj, A) is to make order of 
magnitude estimates of the exponential factors in the terms S2, . . . , sq. To 
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start note that with two million zeros and co near 728 the exponential factor 
in s 5 will be of the same order of magnitude as the leading term s[ when 
a is near 10 11 . A larger a means that r\ may be taken smaller, resulting in 
a narrower interval (co — i],uj + 77), but pushing a to 1.35 x 10 11 seems to 
be as far as one may safely travel in this direction. By repeated subdivision 
and scanning over subintervals of (727, 728) with this a we find H* M — 1 can 
be made to exceed the leading term s[ > 2 x 1CT 3 near co = 727.95202. 
To control the contribution from s 6 we need to take an A the same order 
of magnitude as (aco) 1 ^ 2 , whereupon the constraint a < A 2 forces A near 
1 x 10 7 . We cannot take 77 an order of magnitude smaller than 10~ 5 without 
losing control of s 4 , and the constraint ai] > 2A forces 77 close to 2 x 10~ 4 . 
To keep H* M — 1 — R' M safely above 1 x 10" 4 we chose the following values: 

u = 727.952018 r] = 0.00016 

a = 1.34 x 10 11 A = 1.022 x 10 7 

Then we obtain H* M > 1.006569 and the estimates 

si < 0.002901 s 2 < 10" 49 

S 3 < ID' 746 S 4 < 10" 740 

s 5 < 0.003380 s 6 < 10~ 5 

so that 

I(u,r)) > H-(1 + \R'\) 

> H* M -(1 + R' M ) - 3 x 10" 5 - 1 x 10~ 6 

> 2 x 10" 4 > 

Thus there is a value of u in the interval 

(uj - 77, to + 77) = (727.951858, 727.952178) 
for which n(e u ) > li(e"). Let 

F(u) = ue~ u/2 -{n -li){e u ) 
then we have shown that 

I(co,r])= / K(u-co) ■ F(u)du^ 5 

J ui—r) 

with 5 = 2 x 10~ 4 , therefore since K(u)du is a probability measure, 

pui+ri 

0<5< K{u - uj) ■ sup F(u)du < supF(-u) 

J bJ—T) 
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Now F(u) is continuous except where u happens to be the logarithm of a 
prime, so it follows that for u in some subinterval of (u — rj, u + rj) we have 
F(u) > S, that is to say 



7i{e u ) - li(e u ) > u 



e u/2 ■ 5 > 3.2 x 10 



.151 



We have 



r 



I 

J N 



■N+r 



du r 



\\{N + r) - li(iV) 



\ogN 



log u log 



< < vr(iV + r) -7r(iV) 



and so we obtain 



vr(iV + r) - \\{N + r) > vr(iV) - li(JV) 



r 



\ogN 



This shows that if 7r(e") - li(e n ) > M > then n(e u ) - li(e") will remain 
positive for another [Mu] consecutive integers. Here, Mu > 10 154 . 

Theorem 3.1. There is a value of u in the interval 

(727.951858,727.952178) 

for which 7r(e") — li(e u ) > 3.2 x 10 151 . There are at least 10 154 successive 
integers x between exp(727. 951858) and exp(727. 952178) for which ir(x) > 
li(x) . 
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